Corpus: mri_web_2011_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 86 95 96 99 99
1000 608 877 945 986 991
10000 3122 6440 8165 9219 9446
100000 12636 34802 54502 69294 74663
1000000 12636 34802 54502 69294 74663


Zipf's diagram for sentence endings


Gnuplot diagram

12915 msec needed at 2018-05-25 19:33